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BAYESIAN SEQUENTIAL TESTING OF THE DRIFT OF A 

BROWNIAN MOTION 


ERIK EKSTROM^ AND JUOZAS VAICENAVICIUS 

Abstract. We study a classical Bayesian statistics problem of sequen¬ 
tially testing the sign of the drift of an arithmetic Brownian motion with 
the 0-1 loss function and a constant cost of observation per unit of time 
for general prior distributions. The statistical problem is reformulated 
as an optimal stopping problem with the current conditional probability 
that the drift is non-negative as the underlying process. The volatility 
of this conditional probability process is shown to be non-increasing in 
time, which enables us to prove monotonicity and continuity of the op¬ 
timal stopping boundaries as well as to characterize them completely in 
the finite-horizon case as the unique continuous solution to a pair of inte¬ 
gral equations. In the infinite-horizon case, the boundaries are shown to 
solve another pair of integral equations and a convergent approximation 
scheme for the boundaries is provided. Also, we describe the dependence 
between the prior distribution and the long-term asymptotic behaviour 
of the boundaries. 


1. Introduction 

One of the classical questions in Sequential Analysis concerns the test¬ 
ing of two simple hypotheses about the sign of the drift of an arithmetic 
Brownian motion. More precisely, suppose that an observed process Xt is 
an arithmetic Brownian motion 

Xt = Bt + Wt, 

where the constant B is unknown and IT is a standard driftless Brown¬ 
ian motion. Based on observations of the process X, one wants to test 
sequentially the hypotheses Hq '■ B < 0 and ffi : B > 0. In the Bayesian 
formulation of this sequential testing problem, the drift R is a random vari¬ 
able with distribution //, corresponding to the hypothesis tester’s prior belief 
about the likeliness of the different values B may take. Moreover, it is as¬ 
sumed that B and W are independent. In this article, we consider a classical 
formulation of the problem in which the accuracy and urgency of a decision 
is governed by a 0-1 loss function together with a constant cost c > 0 of 
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observation per unit time. The 0-1 loss function means that the tester gains 
nothing for a right decision but pays a penalty of size 1 for being wrong. 
The overall goal is to find a decision rule minimising the expected total cost 
(provided such a decision rule exists). If the decision is required to be made 
before a fixed predetermined time T > 0, the problem is said to have a finite 
horizon, and if there is no upper bound on the decision time, an infinite 
horizon. 

In the classical literature [5] by Chernoff and [2] by Bather on Bayesian 
sequential testing procedures for the sign of a drift, the special case of a 
normal prior distribution is studied. While Bather considers the 0-1 loss 
function described above as well as a few other alternatives, Chernoff deals 
with a different penalty function, which equals the magnitude \B\ of the 
error. In these papers, it is argued that the sequential analysis problem 
reduces to a free-boundary problem for a function of time and the current 
value of the observation process, but, as in the case of most time-dependent 
free-boundary problems, the free-boundary problem lacks an explicit solu¬ 
tion. Instead, the focus of these and many follow-up articles in the area, 
including 13], i, [7|, and m to mention a few, is on asymptotic ap¬ 
proximations for optimal stopping boundaries (for more references, see the 
survey article [l6|). Only recently, [22| has characterised the optimal stop¬ 
ping boundaries for the original Chernoff’s problem in terms of an integral 
equation, which can be solved numerically. 

In |2I) . the sequential testing problem is solved explicitly for a two- 
point prior distribution by utilising the connection with a time-homogeneous 
free-boundary problem. Notably, the natural spatial variable in this free¬ 
boundary problem is not the value of the observation process, but the con¬ 
ditional probability of the drift taking one of the two possible values. (Since 
there is a one-to-one correspondence between these two processes at each 
fixed time, the free-boundary problem could be transformed into one based 
on the observation process instead, but that formulation would introduce 
time-dependencies and thus make the explicit solution more difficult to find.) 

The fact that the problem can be solved in a very special case of a 
two-point prior, raises a natural question - can the sequential testing prob¬ 
lem be solved for a more general prior distribution? In this article, we in¬ 
vestigate the sequential testing problem under a general prior distribution. 
Since this introduces time-dependencies in the problem, there is generally 
no hope for explicit solutions. Nevertheless, additional structure is found, 
which enables us to arrive at a fairly satisfactory answer. 

To explain in some further detail, following standard arguments, the sta¬ 
tistical problem is shown to admit an equivalent formulation as an optimal 
stopping problem, which we study to characterise optimal decision rules. 
The underlying process of the optimal stopping problem is chosen to be 
the current probability, conditional on observations of X, that the drift is 
non-negative, i.e. 


Ut :=¥{B > 0|Tf). 


BAYESIAN SEQUENTIAL TESTING OF THE DRIFT OF A BROWNIAN MOTION 3 


The pay-ofF function of the associated optimal stopping problem is then 
concave in n, so general results about preservation of concavity for optimal 
stopping problems may be employed to derive structural properties of the 
continuation region. Moreover, the volatility of the underlying process If 
can be shown to be decreasing in time (except for the two-point distribution 
discussed above, in which case it is constant). These important facts allow us 
to show that the optimal stopping boundaries are monotone, so techniques 
from the theory of free-boundary problems with monotone boundaries can 
be applied. In particular, the monotonicity of the boundaries enables us to 
prove the smooth-ht condition and the continuity of the boundaries, as well 
as to study the corresponding integral equations. In the finite-horizon case, 
we characterise the optimal boundaries as a unique continuous solution to 
a pair of integral equations. In the infinite-horizon case, the situation turns 
out to be more subtle. The boundaries are shown to solve another pair of 
integral equations, but whether the system admits a unique solution remains 
unanswered. Instead, we provide a converging approximation scheme for the 
optimal stopping boundaries, establishing that the optimal boundaries of the 
hnite-horizon problem converge pointwise to the optimal boundaries of the 
infinite-horizon problem. Also, we determine the long-term asymptotes of 
the boundaries and describe their dependence on the prior distribution. 

From a technical perspective, we tackle a number of issues stemming from 
the infinite-dimensionality of the parameter space of the underlying process 
n, the particular form of the unbounded payoff function, as well as the 
presence of time-dependent inhnite-horizon boundaries. Filtering and an¬ 
alytic techniques are used to understand the behaviour of the conditional 
probability 11, with a particular focus on the properties that are invariant 
under any prior distribution. Also, the generality of the prior makes the 
verification of the smooth-ht condition more involved than in standard situ¬ 
ations. Moreover, the specihc form of the payoff function with the additive 
unbounded time term requires some additional effort to prove optimality of 
the hitting time in the inhnite-horizon case. Our approach to approximate 
the optimal inhnite-horizon boundaries could possibly be utilised in other 
similar situations. 

Although we manage to solve this classical sequential hypothesis testing 
problem for the sign of the drift of a Brownian motion in the general prior 
case, further generalisations appear to be challenging. The assumption that 
the observation process is a Brownian motion gives a Markovian structure to 
the problem in the sense that the posterior distribution then only depends 
on the current value of the observation process. In contrast, if the obser¬ 
vation process was a more general diffusion process, possibly involving time 
and level-dependent coefficients, then the posterior distribution of the drift 
would depend on the whole observation path, and thus our one-dimensional 
Markovian structure would be lost. Another noticeable limitation is the 
hxed 0-1 loss function, which allows us to use a convenient process 11 as the 
underlying diffusion. For more general loss functions, the natural process 
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to express the stopping problem in terms of is the underlying observation 
process itself. However, then the corresponding boundaries are not necessar¬ 
ily monotone, so the corresponding free-boundary problem is less tractable. 
For Chernoff’s loss function and the particular case of a normal prior distri¬ 
bution, [22j finds an appropriate scaling and time-change of X in that the 
transformation guarantees the monotonicity of the corresponding stopping 
boundaries. The extension to more general prior distributions and other loss 
functions remains an important and challenging open problem. 

The paper is organised as follows. In Section the sequential testing 
problem is formulated and reduced to an optimal stopping problem. In Sec¬ 
tion filtering techniques are applied to find an expression for H in terms 
of the observation process X, and its dynamics in terms of the innovation 
process are determined. We also study the volatility function of H, and it is 
shown that this function is non-increasing in time. In Section]^ the optimal 
stopping problem is studied together with the corresponding free-boundary 
problem, and it is shown that the optimal stopping boundaries are continu¬ 
ous. In Section integral equations for the boundaries are determined, and 
uniqueness of solutions is established in the finite-horizon case. The long¬ 
term asymptotic behaviour of the infinite-horizon boundaries is presented 
in Section Finally, Section is devoted to a special case of the normal 
prior distribution. 

Acknowledgement. We are most grateful to loannis Karatzas for his sug¬ 
gestion to generalise the setting of an early version of the current paper, and 
for sharing his unpublished notes on a related problem with us. 


2. Problem formulation and reduction to an optimal stopping 

PROBLEM 


Let (n,P,T') be a complete probability space supporting a Brownian mo¬ 
tion W and a random variable B with distribution /i such that W and B 
are independent. Define 

Xt = Bt + Wt. 

Writing for the filtration generated by the process X and the 

null sets in X, our goal is to find a pair (r, d) consisting of an F^-stopping 
time r and an J^.^-measurable decision rule d : D —?• {0,1}, indicating which 
of the hypotheses Ho : B < 0 or Hi : B > 0 to accept, in order to minimise 
the Bayes risk 


R{T,d) + ®'[l{d=o,B>o}] + cE[r]. 

Since d is Ti;?--measurable, we have 

R{T,d) = E [E l{d=o} + ct] , (1) 

which shows that, at a given stopping time r, the decision rule 

ifF(B>0|J'.^) >F(B<0|J-^), 

1 0 otherwise 
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is optimal. Consequently, writing 

Ut := F{B > 0| ), 

the sequential testing problem Q reduces to an optimal stopping problem 

V=mfE[g{Ur)+CT], (2) 

tE / 

where g{7r) = vr A (1 — vr) and T denotes the set of F^-stopping times. We 
also consider the same sequential testing problem but with a finite horizon 
T < oo. The corresponding optimal stopping problem is then 

= inf E[ 5 (n^)+ cr], 
where = {tGT ■ t < T}. 

Remark By translation, our study readily extends to testing the hypotheses 
Hq : B < 8 and Hi : B > 9 for any given 0 G M. The methods also extend to 
the case when the two types of possible errors are associated with different 
costs, i.e. when 

R{T,d) = aE[l{d=i_s<o}] + + cE[r] 

for constants o > 0 and 6 > 0 with a ^ b. For simplicity of the presentation, 
however, we assume throughout the article that 8 = 0 and a = b = 1. 

Note that in the cases when g, ((—oo, 0)) = 0 oi g ([0, oo)) = 0, the sequen¬ 
tial testing problem becomes trivial as we can make the correct statement 
about the sign of the drift at time zero. Hence, from now onwards, we always 
assume that 


0 < g ([0, oo)) < 1. 


( 3 ) 


3. Conditional probability of non-negative drift 

In this section, we derive a filtering equation for the distribution of B 
conditional on the observations of X, which is then applied to prove some 
elementary results concerning the conditional distribution of the sign of B. 
We also show that there is an explicit one-to-one correspondence between 
n and the observation process X at each fixed time, and we determine the 
dynamics of X and H in terms of the innovation process. 


3.1. Filtering of the unknown drift. 


Proposition 3.1. Assume that g : M —)> M satisfies |g(5)|/r(d6) < oo. 
Then 


E[g(H)|J-f] 




( 4 ) 


for any t > 0. 
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Proof. The proof is based on standard methods in filtering theory, see e.g. 
[U Section 3.3], yet we include it for completeness. First define an enlarged 
filtration G = {Gt}o<t<cxi as the completion of {cr{B, Ws '■ 0 < s < t)}o<i<oo- 
Clearly, Tf C for any f > 0, so G is an enlargement of F^. Observing 
that Zt := is a G-martingale, we define a new probability 

measure F* on the restriction (fi, Qt) for some large enough T by 


■= ^T- 


It can be shown that under F*, Xt is a Brownian motion independent of 
Qq and therefore also of B and that the law of B is (see [U Proposition 
3.13]). Thus Bayes’ rule (cf., for example, [IT]) gives that 


K[q{B)\Xi^] 


E, [q{B)/Zt\Xi^] 
E* [l/Zt\Xt^] 




for t > 0 and for any function g : M —)• M with \q{b)\fj.{db) < oo. □ 


3.2. Conditional probability of non-negative drift. According to Propo¬ 


sition 3.1 


ni = E[l[o,oo)(B)|T-/'] =7r{t,Xt), 
where the function 7r(t, x) : (0, oo) x M —>■ (0,1) is given by 


7r(t, x) := 




Denoting by 


/^£,3i(d6) •— 


e^^-^V(db) 

4e^"-^V(d6) 


( 5 ) 


( 6 ) 


the distribution of B at time t conditional on = x, we thus have 

7r(t,x)= / UtM^b). 

J [0,oo) 

Proposition 3.2. Assume that g : M —>■ M is non-decreasing and satisfies 
fu k(^)l/^(d^) < oo. Then the function 


is non-decreasing in x for any fixed t > 0. 


( 7 ) 


Proof. We will prove the claim by showing that u{t, ■) is differentiable with 
a non-negative derivative on M. By a standard differentiation lemma (see, 
for example, m Theorem 6.28]), both the numerator and the denominator 
in 0 are differentiable with respect to x with their derivatives obtained by 
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differentiating under the integral sign. Thus the derivative of u with respect 
to the second argument x is given by 

d 2 u{t,x)= [ q{b)bfj.t,xi<ib) - [ q{b)^it,xi^b) / ( 8 ) 

i/ M. M M. 

= Et^x [q{B)B] - Et^x [q{B)] Et^x [B] , 

where Et^x is the expectation operator under the probability measure := 

P(-|Xi = x). Since 

[q{B)B] - E^,, [(?(S)] Ei,, [B] (9) 

= Et,x [{B - Et,x[B]){q{B) - (z(Et,,[5]))] > 0, 

this finishes the proof. □ 


Corollary 3.3. Let a G M and t > 0. Then 

(1) E {B > a \ Xt = x) is non-decreasing in x, 

(2) E {B < a \ Xt = x) is non-increasing in x. 


Proof. The first claim follows by applying Proposition 3.2 to the function 
q{b) = l(a,oo)(^)- The second claim follows from E [B < a \ Xt = x) = 1 — 
E {B > a \ Xt = x) and by applying Proposition 3.2 to the function q{b) = 

l[a,oo)(^)- n 


Proposition 3.4. For any given t > Q, the function 7r(t, •) : M — >■ (0,1) 
defined in ^ is a strictly increasing continuous bijection. 


Proof. First note that 


{B - Ei^3;[i?])(l[0^oo)(-S) - l[0,oo)(Ti,a;[^])) > 0 

and that 


^t,x {{B - Et,3;[5])(l[0,oo)(-B) 


l[0,oo)(Tt,x[5])) > 0) > 0 


since implies fit,x ((~oo, 0)) > 0 and fit,x ([0, oo)) > 0. Consequently, the 
inequality in Q is strict, so a: i—)■ 7r(t, x) is strictly increasing. 

Next, note that 


7r(t, x) 


1 

1 + A{t, x) ’ 


where 


A{t, x) 




( 10 ) 


By monotone convergence, we find that A(t, x) —)> 0 as x —)• oo and A(t, x) —)> 
oo as X —)> —oo. Consequently, lim^^oo 7 r(t, x) = 1 and lima:-s.-oo 7r(t, x) = 0, 
which finishes the proof. □ 


An immediate consequence of Proposition 3.4 is that for any fixed t > 0, 
the spatial inverse 7r(t, 
denote the inverse by x(t, •). 


( 0 , 1 ) 


exists. To facilitate intuition, we 
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We end this subsection with a result that describes the long-term be¬ 
haviour of the process IT. 

Proposition 3.5. lit IIoo a.s. as t ^ oo, where IIoo is a Bernoulli 
random variable with P(noo = 0) = /u((—oo, 0)) and lP(noo = 1) = /^([O, oo)). 

Proof. Firstly, since lit = IE[l[o^oo)(^) I is a bounded martingale, by the 
martingale convergence theorem, the pointwise limit IIoo := hmt_>.oo lit is a 
well-defined random variable closing the martingale 11. By the law of large 
numbers for Brownian motion and Proposition |6 .1 1 below (the proof of which 
is independent of the current result), for any 6 < 0 in the support of /r we 
have 

P (Boo = 0 \ B = b) = I, 
so 

P(noo = 0)= [ F{Uoc = 0\B = b) fj,{db) = fi{{-oo,0)). 

J (— 00 , 0 ) 

Hence as Hoo can only take values in [0,1], the fact that E[noo] = Epo] = 
/r([0, oo)) implies E (Hoo = 1) = /^([O, oo)). □ 


3.3. SDE for the conditional probability of non-negative drift. As¬ 
suming that B has a first moment, the conditional expectation of B exists 
and is given by 

compare Moreover, the observation process X is represented in terms 
of the innovation process 




Wt ■.= Xt- [ E[H| ] ds 
Jo 
as 

dXt = E[H| ] dt + dWt. 

Here W is a standard E^-Brownian motion (see [H Proposition 2.30 on 
p. 33]). Moreover, writing E'^ = {iF^}t>o for the completion of the filtra¬ 
tion {a{Ws : 0 < s < t)}t>o, we have E^ = (see the remark on p. 35 in 

II])- 

From now onwards, the following integrability condition on /i will be 
imposed throughout the article. 


Assumption. 



/i(d6) < oo 


for some e > 0. 


( 12 ) 


Note that this assumption is a minor restriction on our hypothesis testing 
problem since, given any probability distribution /r, the distributions 
all satisfy (12) for t > 0. In other words, no matter what prior distribution 
p one starts with, the condition (12) will be satisfied after any infinitesimal 
time of observation. Also, note that the assumption allows us to extend the 





BAYESIAN SEQUENTIAL TESTING OF THE DRIFT OF A BROWNIAN MOTION 9 


definition of iJt,x in to t = 0. Moreover, if we have a prior distribution ^ 
on M given by 


e(d6) := 


/jj e^'>V(d6) ’ 


then 


— ^2e,x(d6) .— 


e^"^-^"(2")/2^(d6) 

J-^gte-62(2Q/2^(d5)' 


(13) 


Consequently, the distribution /xo,x can be identified with a conditional dis¬ 
tribution at time 0 given that the prior distribution at time — 2e was ^ and 
the current value of the observation process is x. This gives us a general¬ 
isation of the notion of the starting point of the observation process X to 
allow Xq = X ^ 0, and we may regard time 0 as an interior point of the time 
interval. 

A closer look at the condition (12) and the expression ([^ assures that 
the standard differentiability lemma can be applied to differentiate 7r(t, x) 
with respect to both variables multiple times inside (— 2e, oo) x M. Applying 
Ito’s formula to = Tr{t,Xt), we find that 

dn* = (^di7T{t, Xt) + E[5|Tf ]927r(t, Xt) + dt + a27r(t, Xt) dWt 

= d 2 TT{t,x{t,Ut))dWt, 

where the second equality is verified using the expression © and 

f f f 

di'K{t,x) = - / — ijLt,x{<^b) + / Atf,x(d6) / —/it,3;(d5), 

J[0,oq) ^ J [0,oo) Jm ^ 


d 2 Tr{t, x) 



biJt,x{db) 


bjjt,x{db) 



f^t,x{db), 


(14) 


dl-K{t,x)= / nt,x{dh) - 2 / bnt,x{db) / &//t,x(d6) 

J[0,cio) Jm ^[0,oo) 

- / b‘^fit,xidb) [ iJLt,x{db) + 2 f nt,xidb) ( f bfit,xidb) 
Jm. J[0,oo) J[0,oo) \-/r 

Thus the dynamics of lit are specified by a zero drift and the volatility 


a{t, Ili) = d27r{t, x{t, Ili)), (15) 

being a positive function of the current time and the current value of IT. 
Using (j^, the volatility function can be expressed as 

cr{t,TT) = ,r)[-Sl{B>o}] —^t,x{t,iT) {B > 0) E^ ,r) [-6] 

= (1 - - '^^t,xit,^)i^'^{B<0}]- (16) 
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Example (The two-point distribution). Assume that F{B = ai) = 

I — p and P(i? = 02 ) = p for some constants ai < 0 < 02 and p G ( 0 , 1 ). 
Then 


P(i? = a 2 \Xt = x) = 7r{t,x) 


p^a2X—a^tl2 

(1 _ p'^Qaix-a\t/2 _j_ p^a2X-a\tl2 ’ 


and 


a{t, tt) = (02 - ai) 7 r(l - vr). 

This example with a two-point prior distribution is a special case of the 
Wonham filter. 


Example (The normal distribution). Assume that p is the normal 
distribution with mean m and variance 7 ^. Then the conditional distribution 

' I 2 

P(-|At = x) = pt,x is also normal but with mean and variance 

Consequently, 


l+t7^ 


and 


where 


and 


7r(t, x) = $ 


m + 7 X 


Vi- + ^7 


p{z) = 




(17) 


^{y)= / p{z)dz 


are the density and the cumulative distribution of the standard normal ran¬ 
dom variable, respectively. Note that this instance of a normal prior dis¬ 
tribution is a special case of the well-known Kalman-Bucy filter, see for 
example m Chapter 6 ]. 


3.4. Volatility of the conditional probability process. In this section 
we study the volatility function a. The main result. Corollary 3.8, states 
that the volatility is non-increasing as a function of time. 

Let TT G (0,1) be a fixed number and consider the map x(-, vr) ; [0, 00 ) —>■ M 
sending t 1 —x(t, vr). Note that the graph of this function is the trajectory 
that the process Xt has to follow in order for the conditional probability 
process 11* to stay constant at the value vr. Thus we call x(-, 7 r) the n-level 
curve. Some handy regularity of x(-, •) and o'(-, •) is brought to light in the 
following. 


Proposition 3.6. The functions x(-,-) : [0,oo) x (0,1) —)■ M and cr(-,-) : 
[0, 00) X (0,1) —)• M are both . 
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Proof. Define F : (—e,oo) x M —>■ (—e, oo) x (0,1) by F{t,x) = 

where e is as in the assumption (12). The function T is which is evident 


by applying the standard differentiation lemma as in the proof of Proposition 
The Jacobian matrix of F is 


JF{t,x) = 


1 


di7r{t, x) 


0 

d2TT{t, X) 


Since F is invertible and det(x)) = 527r(t,x) > 0 for all t > —e and all 
X G M, the inverse function theorem tells us that the inverse of F is also C^, 
with 

Jp-i{F{t,x)) = {JF{t,x))~^. 

Consequently, x(-, •) is on (—e, oo) x (0,1) with the derivatives dix{t, vr) = 
—9i7r(t, x(t, 7r))/(927r(t, x(t, vr)) and d2x{t,7r) = l/92'T(t, x(t, tt)). Finally, 
since a product of continuous functions is continuous, by the chain rule, 
f7(-,-) is continuously differentiable on (—e,oo) x (0,1) and so on [0,oo) x 
(0,1). □ 


Next, denoting the initial value Uq by ttq G (0,1), we show that the tails of 
the conditional distribution nt^x are decreasing along the level curve x(-, ttq). 

Proposition 3.7. 

(1) If a > 0, then for any t > 0, 


F {B > a\Xt = x{t, TTo)) — F{B > a) < 0. (18) 

Supposing fj,{{a,oo)) > 0, the inequality above is strict if and only if 
/x([0,a]) > 0. 

(2) Likewise, if a < 0, then for any t > 0, 

F {B < a\Xt = x{t, TTo)) — IP’(-S < a) < 0. (19) 

Supposing a)) > 0, the inequality above is strict if and 

only if /i([a, 0)) > 0. 


Proof. We prove only the first of the two claims as the proof of the second 
one follows the same argument with straightforward modihcations. 


In the case /x((a,oo)) = 0, the claim holds trivially with equality in (18). 


Thus we assume that ^{{a,oo)) > 0 in what follows. Writing x{t) instead 
of x{t,7To) for brevity, we note that using ^.(^^([O,oo)) = /u([0,oo)), the 
inequality (18) is easily seen to be equivalent to 


f(a, 


\ ' 
oo) 




l''(d6) fj.((a,oo)) 


< 


/r([0,oo)) 


(20) 


Now, we will split the proof into consideration of two separate cases. 
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Case 1: a > 2x{t)/t. Here 


/[0,a] 




^i{db) e 
> 


ax{t)—a^ ^ 


/i([0,a]) 


/(a,oo) oo)) 


At([0^«]) 

^((o,oo))’ 


which is equivalent to (20). Since /i((a,oo)) > 0, the inequality above is 
strict if and only if /i([0,a]) > 0. 

Case 2: x{t) > 0 and 0 < a < 2x{t)/t. Using that oo)) = oo)), 

we get 

i[ 0 ,oo) ^((-OO, 0 )) 

< /i([0,oo)), 


where the inequality holds since x{t) > 0. Hence rewriting 

/(a,oo) e^-M-^^*/V(db) 

/x((a,oo)) n{[0,oo)) - n{[0,a]) 

and keeping in mind that 0 < a < 2x{t)/t, one clearly sees that 




/i((a,oo)) 


< 


[0,oo) 


/i([0,oo)) 


( 21 ) 


with the strict inequality if and only if /x([0, a]) >0. As (21) is equivalent 
to (20), the proof is complete. □ 


Corollary 3.8. For any vr G (0,1) fixed, the volatility function it(-, vr) : [0, oo) 
defined in is non-increasing in time. Moreover, it is strietly decreas¬ 
ing for any initial prior // except a two-point distribution in which case 
1 1 —)■ cr(t, vr) is a constant function. 


Proof. A key to the proof is a realisation that it is sufficient to prove that 
< 7 ( 0 , 710 ) > a{s,Tro) for any s > 0; the rest will immediately follow by a 
‘moving-frame’ argument. More precisely, by ‘moving-frame’ we mean that 
for any tt G (0 ,1), t > 0, one can think of tit,x{t,TT) the initial prior 
distribution at time zero and so immediately obtain that a{t, vr) > a{t+s, vr) 
for any s > 0. 

Using a shorthand x{t) for x{t,TTo) as before, recall from that 
fT(t,7ro) = (1 - 7ro)Et_3,(t)[Hl{B>o}] - 7roEi^3,(i) [H1 {b<o}]. 

Consequently, 

< 7 ( 0 ,TTo) - f7(t,7ro) = (1 - vro) (E[H1 {b>o}] - { b > o }]) 

+710 (lE4x(t)[-Sl{s<o}] — IE[H1{5 <o}]) 

> 0 
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by Proposition 3.7 Moreover, by the same proposition, the inequality re¬ 
duces to an equality if and only if is a two-point distribution. □ 

Remark It seems difficult to find an easy intuitive argument for the mono¬ 
tonicity of the volatility function. As an example, consider a symmetric 
prior distribution, and a strictly positive time-point t at which the obser¬ 
vation process satisfies Xt = 0. Then the conditional distribution fit,o is 
also symmetric, so 11* = Ho = 1/2. One certainly expects that nt,o is ob¬ 
tained from the prior distribution /i by pushing mass towards zero (this 
is also verified in Proposition 3.7 above). One could expect that the 11- 
process of a distribution with a lot of mass close to zero is sensitive to small 
changes in the observation process since the mass easily may ‘spill over’ to 
the other side of zero, and thus such a distribution gives rise to a compara¬ 
tively large volatility. On the other hand, a concentrated distribution makes 
it difficult to distinguish possible drifts from each other, and changes in the 
observation process would to a higher degree be attributed to the Brownian 
fluctuations. This implies a slow learning process, which indicates a small 
volatility. Corollary 3.8 shows that the latter effect outweighs the former 


one. 


4. Analysis of the optimal stopping problem 
In this section, we study the perpetual optimal stopping problem ([3^ and 


its finite-horizon counterpart under the integrability condition (12). Most 


of the time the emphasis is on the perpetual case, though the corresponding 
results also hold for the finite horizon case by the same arguments. If the 
analogy is straightforward, we only comment on it, otherwise, more details 
are provided. 

4.1. The value function with arbitrary starting points. Recall that 

dUt = a{t,Ut)dWt, 

where 

a{t, tt) = d27^{t, x{t, vr)) > 0 

for all {t,Tr) G [0, oo) x (0,1) (beware that 7r(-, •) is a function, while vr is a 
real number). We embed the optimal stopping problem (36), in which the 
starting point of the process ft is given by Hq := IP(i3 > 0), into the optimal 
stopping problem 

(t,7r) G [0,00) X (0,1), (22) 


v{t,TT) = inf E 5(n*+^) CT 
r£T L 

for the process 11*’'"' = n given by 

dlli+s = cr{t + s, Ili+s) dWt+s , {s > 0) 
n* = vr, 

where T denotes the set of stoppin g ti mes with respect to the completed 
filtration of {n*/[^}s>o. The SDE (^23^ possesses a unique solution since 
f7(-, •) is locally Lipschitz by Proposition 3.6 Furthermore, the embedding 


(23) 
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has a consistent interpretation also at time t = 
the remark following it. Note that choosing r 


0, which is given by (13) and 
= 0 gives v{t, tt) < < 7 ( 71 ). 


Proposition 4.1. The value function v{t,TT) is concave in tt for any fixed 
t > 0 . 


Proof. This follows by a standard approximation argument using optimal 
stopping problems where stopping is only allowed at a discrete set of time- 
points, compare [ 8 ]. 

To outline this, denote by Tt^n, where t < n, the set of stopping times in 
T taking values in {/c 2 “”', k = 0,1, n 2 "} n [ 0 , n — t], n = 1 , 2 ,..., and let 


Vn{t,7T) 


inf E 

reTt.n 


gij^t+r) + CT 


Then Vn{n,7r) = 5 ( 71 ) is concave in tt. By preservation of concavity for 
martingale diffusions, see [TT] (the results of mi extend to the current setting 
with both an upper and a lower bound on the state space), tt i— >■ tt) is 

concave also for t € {n — 2~'^,n). Next, at time t = n — 2“” the value is 
given by dynamic programming as 

Vn{t,TT) = min ( 5 ( 71 ), E [vn{n,Illi^) + c2“"-] } , 

which is concave (being the minimum of two concave functions). Proceeding 
recursively shows that Vn is concave in tt at all times t G [ 0 ,n]. Since Vn 
converges pointwise to u as n —)• 00 , this implies that also v is concave in 

TT. □ 


Proposition 4.2. The value function v{t, tt) is non-decreasing in t for every 
fixed TT G ( 0 , 1 ). 

Proof. This can be proven using approximation by Bermudan options as in 
the proof of Proposition |4.1| above. Indeed, for a fixed time t > 0 one may 
approximate v{t, tt) by the optimal value in the case when stopping times 
are restricted to take values in the set {k2~'^ : n G N, A: G {0,1,..., n2'^}}. 
Since the expected value of a concave function of a martingale diffusion is 
non-increasing in the volatility, see im , the approximation is non-decreasing 
in t by Corollary |3.8[ Letting n —)• 00 finishes the proof. □ 

Remark It is straightforward to check that the monotonicity of v in the 
time-variable also holds in cases when the rate c of the observation cost is 
increasing in time (instead of a constant as in our set-up). As the non¬ 
decreasing value function implies the monotonicity of the optimal stopping 
boundaries (see Proposition |4.5[ ) , the same monotonicity of the optimal stop¬ 
ping boundaries would be present also in the case of the observation rate c 
being increasing in time. Accordingly, we expect the subsequent results to 
extend to that case as well. 


Proposition 4.3. The value function v is continuous on [ 0 , 00 ) x [0,1]. 
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Proof. By concavity of v in the second variable together with the bounds 
0 < u < S', we have that v is Lipschitz continuous in tt for any fixed t, with 
Lipschitz coefficient 1. Thus it suffices to check that v is continuous in time. 
To do this, let t 2 > ti >0 and note that 

v{ti,7r) > > v{t 2 ,TT) -7r|], 

where the first inequality holds since E[i;(t 2 , represents the value of a 

sequential testing problem, started at ti, with the running cost of observa¬ 
tion not started until time t 2 , the second inequality holds by the concavity 
of V in the second variable and the bounds 0 < v < g. Thus 

0 < v{t 2 , tt) - v{ti,Tr) < E[|n*^’’" - 7r|]. 

Since the expected value of a convex function of a martingale diffusion 
is non-decreasing in the volatility (again by [H]) and o'(0, •) > cr(-,-) on 
[0,oo) X (0,1), we deduce that E[|n* 2 ’’^ — 7r|] < E[|n° 2 (K 4 ^ “ "^1] —^0 as 
t 2 — ti'\ 0. This finishes the proof. □ 


Lemma 4.4. ITe have v{t, 1/2) < g{l/2) for all times t > 0. 


Proof. Let t > 0, and define := [t,t -|- e] x [1/2 — (c -|- l)e, 1/2 -|- (c -|- l)e] 
for e small enough so that C [t, oo) x (0,1). Let 


Te := inf{s > 0 : (t -I- s, ^ A^} 


be the first exit time from Af.. By Proposition |3.6| and Corollary |3.8[ 
f7(-,-) is continuous and strictly positive on [0,oo) x (0,1). Thus Ug := 
inf(^ ,r)eAe c’'('S, tt) is strictly positive and non-increasing as a function of e, 
so cJe is bounded away from 0 as e —)• 0. Now, 


5(1/2) - vit, 1/2) > 1/2 - E [g(lffli^J + cr, 

> 1/2 - (1/2 - (c + l)e + ce)E(T, < e) 
= e - {c+ l)eE(re = e). 


(1/2 -k ce)E(re = e) 
(24) 


Here 


(te = e) = E I sup 
Vo<s<e 

< E( sup 

Vo<s<e 


JO 


a(t + u,nl’^i^)dWt+n 


< (c + l)e 


< (c -t- l)e 


< 


sup Wa<{c + l)e/(Tf 

0<s<e 


(25) 


as e —>■ 0, where the first inequality follows from HH Lemma 10]. Conse¬ 
quently, (24) and (25) yield that ^(1/2) — i;(t, 1/2) > 0, which finishes the 
proof of the claim. □ 
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4.2. The structure of an optimal strategy. Recalling that 0 < v{t, vr) < 
g{7r), we denote by 

C := {{t, tt) E [0, oo) X (0,1) : v{t, vr) < g{7r)} 
the continuation region, and by 


V := {{t, tt) E [0, oo) X (0,1) : v{t, vr) = g{7r)} 


the stopping region. Since v is continuous, C is open and V is closed. Resort¬ 
ing to intuition from optimal stopping theory, we expect that the stopping 
time 


:= inf{s > 0 : (t -|- s. 


G V} 


(26) 


is an optimal stopping time in (22). (Note that standard optimal stopping 
theory does not apply since the pay-off process is not uniformly integrable.) 
The optimality of t* is verified below, see Theorem 4.6 


Proposition 4.5. There exist two functions bi : [0,oo) —>■ [0,1/2) and 
b 2 : [ 0 ,oo) —)• ( 1 / 2 , 1 ] such that 

C = {(t, 7 r) : bi{t) < TT < b 2 {t)}. 


The function bi is non-decreasing and right-continuous with left limits. Sim¬ 
ilarly, b 2 is non-increasing and right-continuous with left limits. 


Proof. The existence of bi and 62 follows from the concavity of v and Lemma [4^ 
The monotonicity properties are immediate consequences of Proposition |4.2[ 
Moreover, by the continuity of v, the function bi is upper semi-continuous 
and 62 is lower semi-continuous. Hence, they are right-continuous with left 
limits. □ 


Let us also consider the same optimal stopping problem with a finite 
horizon T > 0. It is written as 

.Tu ^ \g{Ill’^ 


{t, tt) = inf E 
reTr-t 


H+r) + cr 


(27) 


where Tr-t denotes the set of stopping times less or equal to T — t with 
respect to the completed filtration of {nj/[g}s>o. Note that all results for 
the perpetual problem ( 22 ) described above in this section also hold for the 
finite horizon problem (35), with the obvious modifications regarding the 
time horizon, by the same proofs. Moreover, the pay-off process in (35) 
is continuous and bounded, so standard optimal stopping theory (see, for 
example, m Corollary 2.9 on p. 46]) yields that 

:= inf{s > 0 : H*/)^ ^ {bj{t + s), b^{t + s))} 

is an optimal stopping time in (35), where bj and 63 ^ are the corresponding 
boundaries enclosing the finite-horizon continuation region 

C'^ := {(t, vr) E [0, T) x (0,1) : v^{t, vr) < g{TT)}. 

The infinite-horizon problem can be approximated by finite-horizon prob¬ 
lems in the following sense. 
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Theorem 4.6. The functions \ v, bj \ bi, and b^ 62 pointwise as 
T 00. The stopping times t* a.s. as T 00, where t* is defined 

in ( 26 ). Moreover, t* is optimal in ( 22 ). 


Proof. Since > v, we have that bi < < 62 and < r* 


11^ /II TTT/^ 

By bounded and monotone convergence, v'^{t,7r) \ v{t,Tr) pointwise as 
T —)■ 00, so 6^ \ 61 and 62 pointwise as T —)■ 00. Consequently, by 

the monotonicity of bi and bf , it follows that t* a.s. Thus 


E 


ff(n 


tr^)+CT* 


v'^it, tt) = E + ct'^ 

by bounded and monotone convergence. By uniqueness of limits 

u(t,7r)=E + cr 

so T* is optimal. 


□ 


4.3. Optimal stopping boundaries and the free-boundary problem. 

Proposition 4.7. The boundaries bi and 62 satisfy 0 < bi{t) < 1/2 < 
62(t) < 1 for all times t >0. 


Proof. The two middle inequalities are granted by Lemma 4.4 To see that 
61 > 0 on [0,00), without loss of generality, it is sufficient to show that 
61 > 0 on (0,00); this is due to the possibility provided by ( 13 ) to start 
the process 11 slightly earlier. Let us assume, to reach a contradiction, that 
bi(t) = 0 for some t > 0 . Then, by monotonicity, 61 = 0 on [0,t]. By the 
martingale inequality. 


T 


< t) < E ( sup > 1/2 ) < 27 r. 

\0<s<t ) 


Consequently, 

E[r*] > t(l - 27r), 

and so 0 = lim^^^o 9 '(^) ^ hm 7 r\o'i'(0, tt) > cE[r*] > ct > 0, which is a 
clear contradiction. Therefore 61 > 0 at all times. The proof that 62 < 1 is 
analogous. □ 

Proposition 4.8. The triplet {v,bi,b 2 ) satisfies the free boundary problem 

div{t, vr) -|- ^^^fp-d 2 v(t, tt) -|- c = 0 6i(t) < vr < b 2 {t) 
v{t,Tr)=7r 77 < ^i(t) (28) 

v{t,7r) = 1 — TT TT > b 2 {t). 

Moreover, the smooth-fit condition holds in the sense that the function vr 1 —?■ 
v{t,7r) is for all t>0. 


Proof. The proof that the differential equation in ( 28 ) holds is based on the 
strong Markov property and the continuity of v. However, the procedure is 
standard and we therefore omit the argument, referring to the proof of m 
Theorem 7 . 7 ] for the details instead. The value of u for vr ^ {bi{t),b2{t)) 
follows from concavity and the definition of 61 and 62- 
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For the smooth-fit condition, note that the value function vr i—)> v{t, vr) is 
continuous on (0,1) and for tt G (&i(t), ^(i)) as well as for ir G (0, 6i(t))U 
Thus it remains to check the property at bi{t) and b 2 {t). To 
prove the property at bi{t) (the property at 62 (i) being completely 
analogous), note that since v is concave in vr, it suffices to show that 


1 -^ vit,b{t) + e) - v{t,b{t)) 
eto e 


(29) 


Without loss of generality, we do this for t = 0, letting tt = 61 (0). 

Let e G (0,1/2 — vr) and denote by the first hitting time of the stopping 
region for Then 


i;( 0 , vr + e) - u( 0 , vr) > E - gO^f) 


> e - 2E 


(n“ 


-n)!f)i. 




Thus, to prove (29) it suffices (by the Cauchy-Schwartz inequality) to show 
that 


E 


(n' 


0,7r+e 


-n' 


0,7r\2 




1/2) = o(e 2 ) 


(30) 


as e —)• 0. To do this, first assume that a is Lipschitz continuous in tt on any 
compact time interval, and define 


h{t) := E 



0,7r+e 



Fixing T > 0, for t G [0, T] we have 


h{t) = E 


\ ■ 

e + cT(s,n°’-+^)-c 7 (s,n°’-)dW.J 


< / E 

Jo 


D{Tf (n°’-+^-n°’-)' 


ds 


< + D{Tf / h{s)ds, 


where D(T) is a Lipschitz constant for a on [0,T] x (0,1). Consequently, 
Gronwall’s inequality yields 

l2n 


hiT) < 


(31) 


Next, denote by f{y) := tt — j^{y — vr) the affine function satisfying 
/(vr) = TT and /(I) = 0, and note that f < g on [vr, 1]. Therefore, 


cE[r^] =u(0,7r + e)-E[ff(n°f+^)] 

< g{7r + e) - E[/(n°f+^)] 

= TT + e — {tt — - - — tt]) 

1 — TT 

= e/(l - tt), 
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where the inequality follows from the monotonicity of 6 i and the last equality 
by optional sampling. Thus, writing L) = l/(1 — vr), we have 

IP(Te >T)< DelicT). (32) 

Moreover, writing 

T7r,i/2 '■= inf{s > 0 : i (tt, 1/2)}, 

we have 

> 1/2) < P = 1/2) < (33) 


2 


for C = 2/(1 —27r), where we used the martingality of IT to obtain the second 
inequality. Putting together (31), (32) and (33) yields 


E 


< E 


(n°f- n' 

0,7r+e 


0,7r\ 2 




1 / 2 ) 




+ E(re > T 




(n^. -Hr. 

< + D/(cT)). 

Given <5 > 0, it is possible to choose T large enough so that CD/(cT) < 5/2, 
and then to choose e > 0 small enough so that Cee^ < 5/2. This proves 


(30) and thus finishes the proof of the smooth-fit property if a is Lipschitz 


in vr, locally uniformly in t. 

For a general a, due to the regularity of a on [0,oo) x (0,1), one can 
find another volatility function a that is Lipschitz continuous in vr on any 
given compact interval in time, and that satisfies 0 < d < u everywhere 
and cj = (T on [0,oo) x [61 (0), 62 ( 0 )]. By monotonicity in the volatility, the 
corresponding value function v satisfies v > v. On the other hand, since 
a = a on [ 0 , 00 ) x [61 ( 0 ), 62 ( 0 )] and since t* is optimal for the volatility a, 
we also have v < v, so v = v. By the above argument, u is which finishes 
the proof. □ 

Theorem 4.9. The boundaries bi and 62 both continuous. 

Proof. Let us prove continuity of bi (the proof for 62 is analogous). We know 
that bi is right-continuous, so it suffices to assume for a contradiction that 
bi is not continuous at some time to > 0. By monotonicity, 61 (to) > ^i(to—)• 
In the continuation region, div > 0, so (28) yields 


—< —c. 

2 ^ 

Since a is locally bounded away from zero, this means that on each com pact 
set we can find some constant d > 0 such that d^v < —d. By Proposition |4.8| 
the map vr 1 —)• v{t,Tr) is on [ 6 i(t), 52 (t)] for any t > 0 , so for t < to and 
61 (t) < TT < 5i(to), we have 

pir pw 

vit,Tr) - gin) = / di(v - g){t,u) dudw 

< —d(Tr — 6 i(t))^/ 2 . 
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Choosing tt = )+bi(to) letting t ^ to gives 

This contradicts the assumption that {to, belongs to the stop¬ 
ping region, so bi has to be continuous. □ 


Remark Even though, in this section, all the results are formulated for the 
perpetual problem ( 22 ), it is straightforward to check that the corresponding 
results for the finite-horizon problem (35) also hold. In that case, the bound¬ 
aries bj : [ 0 , 1 ] —)• ( 0 , 1 ) and : [ 0 , 1 ] —)• ( 0 , 1 ) are continuous and monotone, 
with 0 < 6 ^ < 1/2 < 6 ^ < 1 on [0,T) and bJ{T) = b'^{T) = 1/2. Also, the 
assertions of Proposition 
in place of {v, bi, 62 )- 


4.8 


hold for {v^ , , 62 ) on the time interval [0, T) 


5. Integral equations for the boundaries 


It is well-known that optimal stopping boundaries, under some condi¬ 
tions, can be characterized by certain integral equations, compare 110 ] and 
[19j . In this section, we study the integral equations for the optimal stop¬ 
ping boundaries arising in our sequential testing problem. For the problem 
(35) with finite horizon, a pair of integral equations is shown to completely 
characterise the optimal stopping boundaries within the class of continu¬ 
ous solutions. The situation in the perpetual case is more delicate, and 
uniqueness of solutions remains an open question. 


5.1. A pair of integral equations for the finite-horizon boundaries. 


Theorem 5.1. Assume that T < 00 . Then the pair (6^,6^) is the unique 
continuous solution of 


ci(t)=E +cf^ ^ F(ci(t + u) < < C 2 {t + u)} du 


(34) 


1 - C2(t) = E 5^(11^'^^*'*^) + cf^ * E(ci(t -I- u) < < C2(t + u)) du 

such that 0 < Cl (t) < 1/2 < C 2 (t) < 1 for all t G [0, T]. 


Proof. For {t, tt) G [0, T) x (0,1) and e > 0 small enough, let 

Te := inf{s > 0 : ^ (e, 1 - e)} A (T - t - e). 

Applying Ito’s formula (more precisely, an extension of Ito’s formula, see 
[181 Theorem 3.1 and Remark 3.2], which can be applied thanks to the 
monotonicity of b'^ and b^) to the process v^{t -|- s A r^, and then 

taking expectations yields 
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Since —)• T — t as e —)• 0, it follows from Proposition 4^ and bounded 

convergence that 


E 


=v^{t,7r)-c 


rT-t 


{t + u) < + u)) dn.(35) 


Plugging in vr = b"[{t) and tt = shows that {bj^b^) solves (34). 

For uniqueness, assume that (ci, C 2 ) is another continuous solution to (34) 
with 0 < ci(t) < 1/2 < C 2 {t) < 1, and define 

rT-t 


V{t,n) :=E 5(4^) 


+ c 


E(ci(t + u) < < C 2 {t + u)) dM.(36) 


Then V{t,ci{t)) = ci{t) and V{t,C 2 {t)) = 1 — C 2 {t) by ( [34| ), and P(T, vr) = 
g{'K). Moreover, by the Markov property, the process 

:= + c f l(ci(i+ii),c 2 (t+M))(nt+„) dn 

40 

is a martingale for any (t, tt). In particular, the process 

Ms := v^{t + + cj^ du 

is also a martingale. 

Claim 1 : V{t, it) = g{7r) for vr ^ (ci(t), C 2 (t)). 

Assume that vr < ci(t) (the case tt > C 2 {t) is similar), and let 

7 c := inf{s > 0 : > ci(t + s)} A (T - t). 

Then 


P(t,7 r)=E Vit + ^c,Ill+-yJ 


= E 


n 


t, 7 T 

t+^c 


= TT = g{TT), 


whith the first equality being justified by optional sampling and the martin¬ 
gale property of M, the second by (34), and the third by optional sampling 
and the martingale property of IT. 

Claim 2 : V > . 

Take (t, tt) such that ci(t) < tt < C2(t), and let 

Tc := inf{s > 0 : ^ (ci(t -|- s),C 2 {t + s))} A (T - t). 

Then 


P(t,7r)=E[p(t + Tc,n‘/^,J 


- cE 


l(ci(t+M),C2(l+tt)) du 


= E 


L^o 

.T/ 


aO^t+rJ + cE [Tc] >V^{t,7r). 


From this and Claim 1, Claim 2 follows. 

Claim 3 : bj < ci and C 2 < 

Assume that bj{t) > ci{t) for some t. Choose tt = ci(t), and let 
7 fe := inf{s > 0 : FIj/)^ >bj'{t + s)} A{T -t). 
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Then, by right-continuity of bj and ci, 


E 


nb 


l(ci(t+li),C2(l+'!i)) (n^+\) dn 


> 0 . 


( 37 ) 


On the other hand, by optional sampling and martingality of M and M we 
have 


0 = V{t,7r) — v'^{t,Tr) 


= E 




t.TT 


-l-cE 


nb 


'^{ci(t+u),C2{t+u)) (^^^JdR 


Ljo 


Since 1^ > u by Claim 2, this contradicts (37) and thus bJ < ci. The claim 
C2 < b 2 is proved similarly. 


Now, it follows from ( |35| ), (36) and Claim 3 that V = . Therefore, since 

< g for vr G {bJ {t),b^t)) it follows from Claims 1 and 3 that bJ = ci 


and bn = C2, which finishes the proof. 


□ 


Remark A closer inspection of the proof above shows that uniqueness holds 
in the larger class of pairs (ci,C2) such that ci is right-continuous with no 
negative jumps and C2 is right-continuous with no positive jumps. 

5.2. A pair of integral equations for the infinite-horizon bound¬ 
aries. 

Theorem 5.2. The pair ( 61 , 62 ) is a solution of 

( bi{t) = c F{bi{t + u) < < b 2 {t + u)) du 

I 1 - b 2 {t) = c E(6i(t -I- rt) < < b 2 {t + u)) du. 

Proof. For fixed T > 0, an application of Ito’s formula as in the preceding 
proof gives 


E 


v(T, = v{t, tt) — c I E(6i(t -I- rt) < < 62(6 -|- u)) du. 


rT-t 


t.TT 


Since v is bounded and 11^ converges to either 0 or 1 as T —)• 00 by Propo¬ 
sition 3.5, we find that 

poo 

v{t, tt) = c / E(6i(t -I- u) < < b 2 {t + u)) du. 

Jo 


Plugging in vr = 61 (t) and vr = b 2 {t) shows that (61,62) solves (38). □ 

Remark The main technical difficulty when trying to apply the uniqueness 
proof of Theorem 5.1 to the perpetual problem lies in the lack of a straight¬ 
forward extension of the optional sampling theorem to unbounded, possibly 
infinite stopping times. 
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5.3. The case of a symmetric volatility function. Now assume that the 
volatility function is symmetric about vr = 1/2, i.e. a{t, tt) = a{t, 1—vr). This 
is the case, for example, if the prior distribution // is symmetric about zero 
in the sense that /i([0,a)) = ^((—a,0)) for all a > 0. Then, by symmetry, 
bj = 1 — 62 , and we set 6 ^ := bj. The following result is a straightforward 


consequence of Theorem 5.1 


Theorem 5.3. Assume that a is symmetric about tt = 1/2. Then the 
boundary b^ is the unique continuous solution of 


c{t) = E 



T > 



E(c(t + n)<n‘:/W 


<1 — c{t + u)) dn(39) 


such that 0 < c{t) <1/2 for all t G [ 0 ,T]. 


Remark Although not necessarily symmetric, all normal prior distributions 
as well as all two-point priors give rise to symmetric volatilities, compare 
Section [3l 


6. Long-term asymptotics of the volatility and the boundaries 

Since the boundaries bi and 62 are monotone, the limits 6 j(oo) := limt_^oo hit), 
i = 1,2, exist with 5i(oo) < 1/2 and b 2 it) > 1/2. In this section we de¬ 
termine these limits. To do that, we first derive a few limiting properties 
of level curves as well as study the limit cr(cx),7r) := limt_^oo '^) of the 
volatility. 

Let us define 


-- inf{s > 0 

: /i ([s, s -|- e)) > 0 for all e > 0 } 

(40) 

sup{s < 0 : 

: /i ((s — e, s]) > 0 for all e > 0 }. 

(41) 


We write m = (I + r)/2 for the midpoint between I and r. 

The following proposition will serve as a useful device for understanding 
long-term volatility. 

Proposition 6.1. 

1. If a > m, then 7 r(t, at) —)• 1 as t ^ 00 . 

2. If a < m, then 7 r(t, at) —)• 0 os t —)> 00 . 

Proof. Given a G M, define 

. . J(-oo,0) exp(-( 6 -a) 2 i)//(d 6 ) 

/[0,oo)exp(-(fc-a)2i)p(d6) ’ 

so that 7 r(t, at) = 1/(1 -|- hit)). We will prove the claims in two different 
cases separately. 

(i) First case: I < r. 
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1. First note that, in view of Proposition |3.4[ it suffices to treat the 
case a G (m, r). For such a, fix 7 > r such that 7 — a < a — l. Then 

, ,,, ^ /(-oo.o) ^ 

Ht) < -- \2tY7 -TTm-^ ^ 

exp(-(7-a)^2) Jjo ,^]/i(d 6 ) 

as t —)> 00 . Hence 7 r(t, at) —>■ 1 as t —)> 00 . 

2. For the second result, suppose that a < m, and note that it suffices 

to treat the case a G {I, m). Let 7 < t be such that a — < r — a. 

Then 

exp (-(a - 7 ) 2 i) //(d5) 


h{t) > 


00 


exp(-(r-a) 2 i) t^(d 6 ) 

as t —)■ 00. Hence 7 r(t, at) —>• 0 as t —)■ 00. 

(ii) Second case: I = r = 0. 

1. Assume that a > 0, and let e > 0. Then 

^ exp (-(g + e) 2 i) //(d 6 ) + exp (-a 2 i) ^{db) 

/[o,a] exp (- 022 )/^(d5) 

^([0,a]) 

as t —>■ 00. Thus, since e > 0 is arbitrary and /x([—e, 0)) —)• 0 
as e —>■ 0, we conclude that /i(t) —>■ 0 as t —)• 00. Consequently, 
7 r(t, at) —>■ 1 . 

2. Next, assume that a < 0. Choosing 7 G (a,0) with fj,{{a,'y)) > 0, 
we find that 

exp(-(a- 7 ) 2 i)/(^,^^^(d 6 ) 


h{t) > 


00 


exp(-a2i)/jg_^^/i(d6) 

as t —>■ 00 . Consequently, 7r(t, at) —)• 0 as t —)• 00 , which finishes the 
proof. 

□ 


Remark Notice that Proposition | 6 . 1 | implies that for any fixed value tt, the 
corresponding level curve x(-, vr) satisfies limi^oo(ct —x(t, tt)) = 00 if a > m, 
and limt_,.oo(at — x(t, vr)) = —00 if a < m. 

6 . 1 . Long-term behaviour of the volatility. Now, we are in a position 
to determine the limit <7(00,71) := limi_,.oo ( 7 (t, vr) of the volatility as time 
increases. 


Proposition 6.2. The long-term limit of volatility satisfies cr(oo,7r) = (r— 
t) 7 r(l — tt). 

Remark Note that if / = r = 0 , then the volatility converges to zero as 
time tends to infinity. Also, note that if Z < r, then the volatility tends to 
the volatility from the case of a two-point prior distribution. 
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Proof of Proposition fS.Sl We first claim that 

—)• 7rr 

as t —)• oo. To see this, suppose that a > r and take 7 G (t, a) such that 
7 — r < a — 7 . By Corollary |3.3| and Proposition |6.1[ for all large enough t, 

IEt,x(t, 7 r) [-Bl(a,oo)(^)] [Bl(a,oo)iB)] 

^ exp (-(a - 7)7) f(a.oo) 

exp (-(7 - r)7) (i(|r,7)) 

which tends to 0 as t —)• 00 . Now, the fact that ^t,x(t,n){B G [ 0 ,r)) = 0 for 
all t > 0 finishes the claim. 

Next, straightforward modifications of the arguments above show that 

as t —)■ OO. Since 

cr{t,7r) = (1 - [^l(-oo, 0 )(-B)] , 

this finishes the proof. □ 

Remark Similar arguments as in the proof above show that fJ-t,x{t,TT) 

(1 — 7r)6i + 7T6r as t —>• 00 . Thus, along a level curve x(-, 7 r) the conditional 
distribution of B converges weakly to the two-point distribution with mass 
TT at r and mass 1 — tt at Z. 


6.2. Long-term behaviour of the boundaries. 

Theorem 6.3. 

• lfl = r = 0, then 61 ( 00 ) = 62 ( 00 ) = 1 / 2 . 

• Ifl<r, then 61 ( 00 ) = 6 ^“* and 62 ( 00 ) = 62 ”^, where < 1/2 < 
62 ”* are the optimal boundaries for a two-point prior distribution 
with mass at points separated by 0 and at a distance r — I from each 
other. 


Proof. Since the volatility cr(-,-) is non-increasing in time. Proposition 6.2 


and Dini’s theorem yield that a{t, ■) converges to <7(00, vr) = (r — Z) 7 r(l — tt) 
uniformly on the compact interval [ 6 i( 0 ), 62(0)] as t ^ 00. Therefore, given 
e > 0 we can find to large enough so that a{to, tt) < (e -|- r — Z)7r(l — vr) for 
TT G [61(0), 62(0)]. Define 

d'(t,TT') . (T(t, vr) (^)) 

and denote by v the corresponding value function. Since the optimal stop¬ 
ping problem ( 22 ) is monotone in the volatility (compare e.g. [Ill Lemma 
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10]), we have that v > v. On the other hand, since a = a on the contin¬ 
uation region {(t, tt) : bi{t) < vr < b 2 {t)}, we also have v < v, so v = v. 
Moreover, by monotonicity in the volatility, 

V^+r-l <y=y< yr-l 

on [to^oo) X ( 0 , 1 ), where denotes the value function corresponding to 
a volatility function a 7 r(l — vr). Since the value function v is squeezed in 
between the value functions and from time to, the optimal stop¬ 

ping boundaries bi and 62 are squeezed in between the corresponding optimal 
stopping boundaries for and By inspection of the explicit for¬ 

mulas in the two-point distribution case, see m Theorem 21.1], the gaps 
b\~'' — bl~^^~’' and — 62 “^ between the boundaries vanish as e —)■ 0 , 

which finishes the proof. 

□ 

Remark It is also of interest to determine 6i(0) for z = 1,2 in order to 
find the best bounds for the continuation region. It seems difficult to deter¬ 
mine these quantities in general, but an upper bound for the continuation 
region initially (and thus at all times) can be established by solving the free¬ 
boundary problem for the time-homogeneous volatility it(0, tt). However, we 
expect these bounds to be rather crude, and therefore do not provide any 
details. 


7. The normal prior distribution 


In this final section, we study the case of a normal prior distribution 
in further detail. In particular, we show that the kernel in the integral 
equations determined in Section can be calculated explicitly for normal 
priors. For similar results in the case of the two-point distribution, see [H]. 

First, recall from Section that a normal prior distribution with mean 
m and variance 7 ^ leads to a volatility surface ( 7 (-,-) that is symmetric 
around the line vr = 1 / 2 . As a result, the stopping boundaries 61 and 62 
are also symmetric around vr = 1/2 with 62 (t) = 1 — bi{t), so it suffices 
to solve a single integral equation to determine both boundaries. Next, 
recall that the conditional distribution is normal with standard deviation 
7 (t) := 7 / Y^l -I- t 7 ^. Consequently, the x-value that gives Tr{t,x) = b{t) is 
such that the conditional drift equals 


m{t) := <I> ^{b{t))'y/\/1 -h 

Now, given s > 0, let Y denote a N{m{t)s, s-|-s^ 7 ^(t))-distributed random 
variable. Then using p^, we calculate 


K{t,s,b{t),b{t + s)) :=I 

= P ^ 6 (t -|- s) < 

= ^(d2) - Hdi), 


-|- s) < < 1 — b{t + s 

( m(t)-I- 7 ^(t)y 


V7(i)\/l + 'S7^(i), 


< 1 — b{t + s) 
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where 


and 


:= 


do := 


+ s))7(t)\/l + + s'y^(t)) 


Thus the kernel K appearing in the integral equation (39) and in the corre¬ 
sponding equation for the infinite-horizon formulation is explicit. 



Figure 1. The boundaries 6i and 62 calculated numerically 
for the case of A^(m, l)-prior (note that the boundaries do 
not depend on m G M.) and the cost of observation c = 0.5 
per unit time. 
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